Binary Searching with Non - uniform

نویسندگان

  • Gonzalo Navarro
  • Eduardo Fernandes Barbosa
  • Ricardo Baeza-Yates
  • Walter Cunto
  • Nivio Ziviani
چکیده

We study the problem of minimizing the expected cost of binary searching for data where the access cost is not xed and depends on the last accessed element, such as data stored in magnetic or optical disk. We present an optimal algorithm for this problem that nds the optimal search strategy in O(n 3) time, which is the same time complexity of the simpler classical problem of xed costs. Next, we present two practical linear expected time algorithms, under the assumption that the access cost of an element is independent of its physical position. Both practical algorithms are online, that is, they nd the next element to access as the search proceeds. The rst one is an approximate algorithm which minimizes the access cost disregarding the goodness of the problem partitioning. The second one is a heuristic algorithm, whose quality depends on its ability to estimate the nal search cost, and therefore it can be tuned by recording statistics of previous runs. We present an application for our algorithms related to text retrieval. When a text collection is large it demands specialized indexing techniques for eecient access. One important type of index is the suux array, where data access is provided through an indirect binary search on the text stored in magnetic disk or optical disk. Under this cost model we prove that the optimal algorithm cannot perform better than (1= log n) times the standard binary search. We also prove that the approximate strategy cannot, on average, perform worse than 39% over the optimal one. We connrm the analytical results with simulations, showing improvements between 34% (optimal) and 60% (online) over standard binary search for both magnetic and optical disks.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Multilevel Input Ring-Tcm Coding Scheme: a Method for Generating High-Rate Codes

The capability of multilevel input ring-TCM coding scheme for generating high-rate codes with improved symbol Hamming and squared Euclidean distances is demonstrated. The existence of uniform codes and the decoder complexity are also considered.

متن کامل

TOPOLOGY OPTIMIZATION OF PLANE STRUCTURES USING BINARY LEVEL SET METHOD AND ISOGEOMETRIC ANALYSIS

This paper presents the topology optimization of plane structures using a binary level set (BLS) approach and isogeometric analysis (IGA). In the standard level set method, the domain boundary is descripted as an isocountour of a scalar function of a higher dimensionality. The evolution of this boundary is governed by Hamilton–Jacobi equation. In the BLS method, the interfaces of subdomai...

متن کامل

Essentially Non-Oscillatory Adaptive Tree Methods

We develop high order essentially non-oscillatory (ENO) schemes on non-uniform meshes based on generalized binary trees. The idea is to adopt an appropriate data structure which allows to communicate information easily between unstructured data structure and virtual uniform meshes. While the generalized binary trees as an unstructured data structure can store solution information efficiently if...

متن کامل

BQIABC: A new Quantum-Inspired Artificial Bee Colony Algorithm for Binary Optimization Problems

Artificial bee colony (ABC) algorithm is a swarm intelligence optimization algorithm inspired by the intelligent behavior of honey bees when searching for food sources. The various versions of the ABC algorithm have been widely used to solve continuous and discrete optimization problems in different fields. In this paper a new binary version of the ABC algorithm inspired by quantum computing, c...

متن کامل

Learning Good Variable Orderings

Variable ordering heuristics are used to reduce the cost of searching for a solution to a constraint satisfaction problem. On real problems that have non-binary and non-uniform constraints it is harder to make the optimal choice of variable ordering because surprisingly little is known about when and why variable ordering heuristics perform well. In an attempt to address this problem we present...

متن کامل

Nonlinear inversion of potential-field data using a hybrid-encoding genetic algorithm

The genetic algorithm is of advantages to solve an inversion of complex non-linear geophysical equations. Its multi-point searching is able to find the globally optimal solution and avoid falling into a local extremum. The searching efficiency of the genetic algorithm is a key to successfully resolve a geophysical inversion problem in a huge model space with multi-parameters. Encoding mechanism...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1998